PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PGSC0003DMP400022115
Common NameLOC102597132
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family HD-ZIP
Protein Properties Length: 281aa    MW: 32012.3 Da    PI: 7.9281
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PGSC0003DMP400022115genomePGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox60.33.1e-19115169256
                           T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           rk+ ++tk+q  +Lee F+++++++ ++++ LAk+lgL  rqV vWFqNrRa+ k
  PGSC0003DMP400022115 115 RKKLRLTKDQSVVLEESFKEHNTLNPKQKQALAKRLGLRPRQVEVWFQNRRARTK 169
                           788899***********************************************98 PP

2HD-ZIP_I/II127.36.6e-41115204191
           HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLre 89 
                           +kk+rl+k+q+ +LEesF+e+++L+p++K++la++Lgl+prqv+vWFqnrRARtk+kq+E+d+e Lkr++++l+een+rL+kev+eLr 
  PGSC0003DMP400022115 115 RKKLRLTKDQSVVLEESFKEHNTLNPKQKQALAKRLGLRPRQVEVWFQNRRARTKLKQTEVDCELLKRCCENLTEENRRLQKEVQELR- 202
                           69*************************************************************************************9. PP

           HD-ZIP_I/II  90 el 91 
                           +l
  PGSC0003DMP400022115 203 AL 204
                           55 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF046183.1E-16187IPR006712HD-ZIP protein, N-terminal
Gene3DG3DSA:1.10.10.603.1E-18100169IPR009057Homeodomain-like
SuperFamilySSF466891.42E-18107172IPR009057Homeodomain-like
PROSITE profilePS5007117.54111171IPR001356Homeobox domain
SMARTSM003891.5E-16113175IPR001356Homeobox domain
PfamPF000461.2E-16115169IPR001356Homeobox domain
CDDcd000867.29E-15115172No hitNo description
PRINTSPR000311.3E-5142151IPR000047Helix-turn-helix motif
PROSITE patternPS000270146169IPR017970Homeobox, conserved site
PRINTSPR000311.3E-5151167IPR000047Helix-turn-helix motif
SMARTSM003401.2E-26171214IPR003106Leucine zipper, homeobox-associated
PfamPF021834.9E-11171205IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 281 aa     Download sequence    Send to blast
MLENQDLRLS LSLSFSENKT TNPLQLNSWI GSFPSSDRNL EKCRTFLKGI DVNIIPTITE  60
EEEEEVGVYS PNSSISTLSG NKRNEREIIN CCEELEIERE CSRSISDEED GETSRKKLRL  120
TKDQSVVLEE SFKEHNTLNP KQKQALAKRL GLRPRQVEVW FQNRRARTKL KQTEVDCELL  180
KRCCENLTEE NRRLQKEVQE LRALKLSPQF YMQMTPPTTL TMCPSCERVA GPSTPSTSGA  240
ASVDARANQM VLARQRPVPF NLWTASPVPH RPINALHPRS *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1113119SRKKLRL
2163171RRARTKLKQ
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754471e-145HG975447.1 Solanum pennellii chromosome ch08, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006358337.10.0PREDICTED: homeobox-leucine zipper protein HAT4
SwissprotQ054668e-94HAT4_ARATH; Homeobox-leucine zipper protein HAT4
TrEMBLM1AXL40.0M1AXL4_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000325460.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA11182485
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G16780.13e-83homeobox protein 2
Publications ? help Back to Top
  1. Xu X, et al.
    Genome sequence and analysis of the tuber crop potato.
    Nature, 2011. 475(7355): p. 189-95
    [PMID:21743474]